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disposed to the idea that both causes' and effects of educational 

phenomena are inTierently multivariate In nature (see, for example, 

TatfS^kaVs recent review [1973] as well aS older surveys o? 

multivariate statistical applications in education and psychology 

cbntalned in books by Cattell [1962] and Whitla [1968]). In their 

statistical applications, however ,^ducational researchers have 

apparently taken only half of this statement seriously. Thus, 

researchers have recognized the fact that causes of educational 

phenomena are multidimensional and have employed techniques of 

multiple lin^r regression with increasing levels of sophistication 

to investigate both the combined effects of, and interrelationships 

among, multiple independent variables. For the most part, however, 

researchers have not investigated the ' multivariateness^' of ' educat ional 

Outcomes. Instead, the typical approach has been to study the effects 

of a common set ot independent variables on each of several criterion 

variables separately. 

'This practice Js not unique to education. Lana and Lubin (1963), 

for instance, reviewed the published articles in over three years 

worth of three APA journals' in an effort to discover the frequency 

with which multiplfe-cr iteria designs were used and how they were 

analyzed. One of their findings, of Interest here, was that about 

- * * 

one-third of the studies used designs which involved multiple, 
correlated criteria; yet only one of the studies reviewed took the 
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correlation among the criterion variables into account. Multiple 
criteria were usually analyzed by applying analysis of variance 
techniques to each variable separately. 

In preparing for this paper, I did a quick, informal review 
of-' the studies publis^hed in the last three years in the i^erican 
Educational Research' Journal and the Journal of Educational 

Psychology . Over 80 percent of the experimental" and comparative 

s 

studies reported in the^ journals investigated effects on multiple 
criteria. Although several of these studies employed multivariate 
techniques, the majority of them (approximately threes-fourths) 
employed multiple univariate analyses of variance. 

The typical- example would be a study in which several independei 
variabiles (including p|:etests, demographic data,, and incidence of 
treatment measures) are examined separately for their effect on, 
say, measures pf motivatioii, achievement , 'a*nd attitude toward school 
variables that few would deny are correlated (perhaps highly) in the 
population. TJi^^odfsible ramifications of performing separate . 
Univariate analyses on correlated criteria are well-known (e.g.. 
Hummel and Sligo [1971]). If a Type I error occurs ;tn tests 
involving one criteria, the probability is greater than a that it 
will occur in the tests involving the other criteria also. This 
fact, by itself, should prqj/ide sufficient motivation to the 
researcher to seek multivariate techniques for the analysis of 
multiple-criteria desi^s. At\the very least a multivariate 
technique offers the researcher a procedure for controlling 
experiment-^wise probability of a Type I error. * 

y 2 ■ 



*pf'cour8e,( thete .are other reasons for cabling for an increased 
"^application of multivariate methods. * For instance. Snow (^97A) 
has recently argued that research designs in education need to be- 
more represefitative of possible outcomes. That is, researchers need 
'^to choose samples of dependent variables which are representative of 

the phenomena being studied. Snow argues ef f ectively^h^t if the 
' results of educational e.xperimpnts are jgP be generalizable ; they must 
first be representative. Part .of Snow's thesis can be interpreted ^ 
as a call for multivariate investigations in educational research. 
Multivariate statistical procedures appropr iatel^^fi-t this epis^- - ^ 
temologfca.1 point of view that the "effects" in educational settings 
.are rarely, if ever, unldimensior&l and' therefore should not be ^ 
studied in isolatiL. The study of multiple outcomes eimultaneoualy ^ 
can afford the researcher an opportunity to "uncover" complex 
relationships among treatmei^t and outcome variables that might other- 

'wise go undiscovered. An ^xample might bftya situation 4n which It is 

\ - , ^ 

found that two experimental insJfructional prbgrams lead not 0T)ly to 

increases in several measures of achievement aiiA njotivation, but also, 
under one of the programs^ to increases in the correlation among the 
' variables'. SxM a finding might have important Implications for 
understanding the dynamics of the instructional programs. 

. If the need for greater application of multivariate statistl.cal 
procedures can be established so easily, why then are these procedures 
not used more frequently in educational research? Although it is 
Impoe^-ble to answe^r this question completely, there is no. doubt 
that at least part of the answer lies in the fpct that multivariate 

3 
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- procedures are more complex^ both mathematJLcally and conceptually, 
then univariate procedures. Until recently, researchers who so^^ght/ 
to use multivariate techniques had to first develop a fairly high 
level of mathematical skill to be able to read the exlstltig text- 
books. Even "given* the requisite mathematical ability, the ^ 
existing textbooks were often legs than useful to th^ applied 
researcher since they contained a paucity of examples on anything 
even bearing a passing resemblance -to the kinds of problems 
encountered ^ educational research, • • 

In recent years ,) however , a number of Intermediate-level (in 
^.mathematics) books on multivariate statistics have been published 
(e.g.. Coo ley & ;,ohn^ fl^pj^ Finn [1974], Harris [1975], Morrison 
[1967], Press [1972]! T,atsuokff\ll971] , and Van de Geer [1971]) 
Although these books usually recJjiMre some ability In matrix algebra, 
the mathematical development^^,-.fe^d to be highly tractable. More 
than that, however, these books provide a rich source of examples 
of applications of multivariate procedures. It Is likely that,vdue 

■ / 

to these bodks alone^, many more applications of multivariate 
procedures will appear In the published! educational research' 



V 

*^lterature In the 



next few years. 



Another development of recent years that Is likely to lead 
to a greater use of multivariate procedures Is^the Increasie in the 
number of computer programs that have become available. Many of the 

<^ 

books listed above (especially In Harris [1975]^ Press ^W^^^ ' ^ 
and Cooley & Lohne^s [1971]) provide references, to existing computer 
programs, ThCj^allablllty of these programs has further lessened the 
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demand on applied researchers for ttiatheqatlcal sophistication. 
Of course, the researcher still neecls a conceptual unlets tanding 
of^ the particular procedures he intends to use. Providing the 
researcher h^^a clear understanding of what he wants to do 

statistically, as well as a conceptual, understanding of the^ 

•> ^ * - . ' 

Statistical technique* he inTCnds to employ, then given a computer 

program for the statistical procedure, a multivariate analysis «^ 
is rel<d:ively straight-forward. This is especially true for most 
applications of the multivariate gerieral linear hypothesis (MGLH) , r' 
the subject of this paper to which we now turn. ^. 

The MGLH is the most general of all parametric linear 
statistical, procedures.. In fact^ all linear statistical tests 
(univariate and multivariate) can be developed as special subclasses 
of MGLH theory. This includes , among other" procedures , factor 
analysis, discriminant analysis, pi^ediction (or regression) analysis, 
and the analysis of variance and covariance. Although the mathe- 
matical development of MGLH theory is complex, its conceptualization, 
at least insofar as the most common applied situations are 'concerned*f 
is not particularly difficult. It does require an elementary 
facility for matrix algebra, however. ^ simple statement of the 
theory is that if a set of dependent varJLobles, Y (where the tilde 



underscore denotes a matrix), is linearly related to a set of 
independent variables, X, by the equation, 

Y - X6 + E, ' i [1.1] 

where 8 is 'a set of unknown parameters of interest, then any 
hypothesis of the form 
H : A8C D ' 
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is testable. In this equation, the matrices A and C are used to select 
particular elements from among rows and columns of 6- The matrix, D, 
is a matrix of constants specified by the. Investigator. Usually p ^ 
Is set equal tcf 0, a matrix of zeros. ' 

In the next section a brief Introduction to the theory of the 
MGLH ±s provide*. ReadersNfho wish to pursue this development In 
greater^epth are encouraged to consult the appropriate chapters In 
Kempthorne.(1952), Kulbach (1968), Metidenhall .(1968) , and Seal ^(196A). 
Early p^ipers by Smith, Gnanadeslkan, and Hughes (1962) and Bock (1963b) 
develop the MGLH In a way that Is useful to those 'who may wish fo^ 
program the computations. Thorough, but highly mathematical, 
presentations of the MGLH have been given by Anderson (1958), Rao (1965), 
and Seber (1966). 

J 

In the third section of this paper, several geneiral algebraic 

) 

examples of typical applications have been provided. Other examples 
of applications. of the MGLH, using real data, can be found In Bock 
(19^3a), Bock: and Haggard (1968), Jones (1966), and Fln^ (1974). ^ 
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2, TntvoduQtion to the Theory of the MGLH 

In tJils paper, Interest is primarily focused upon the analysis 
of variance and the analysis of CQvarlance. Repeated measures (or 
profile) analyses of variance and covar lance are also covered 
but are treated as^ specilal lpstat\ces of multivariate analyses of ^ 
variance and covarlandeA Before proceeding to Illustrative application 
of the MGLH, however. It Is first necessary to lay some of the 
mathematical ground work. 

Definition of the Model * ^ ' 

We begin by rewriting the general equatlo^n for a multivariate 
linear model,* \ 

where the component matrices are described In the paragraphs which 
follow. 

Y Is an N X p matrix consisting of 1 x p response row vlictors - 
for each of N subjects.^ An element/ In Y,V^enpted as , would con- 

stltute the £.'th response measure taken"^ the i'th individual In 
^oup j. 

^ Is an N X q matrix of predictor and/or design varlablesT In 
multiple linear prediction, X would contain fhe 1 x q vectors 
predictor scores for each of the N subjects. In analysis ^f variance, 
X would be a matrix f)f design variables. In the analysis of covarlance 
X wauld contain h combination of predictor scores and design vai;lables. 

B Is a q X p matrix of unknots parametoc^, the elements which 
are of Interest In tests of MGLHs. Pependlng upon the choice of X, 




the elements of § may represent subgroup tpi^anfe (expected - values)., ' 
contrasts among subgroup means, population regression coefficients, 
etc. 

E i« an N X p matrix whose columns, E , 4re error vectors for 
the p response measures. ^Stated differently, for each row vector of 



response observations, Y^j , there Is a corresponding row vector, 
E^j, of disturbances usually due to errors of measureigent and lack 
of linear fit to vthe model. 

V 

Aaewjiptione 

The as'sumptlons appllcableN^ the model defined above, for 
purposes of testing hypotheses, are the following: ^ 

1. The model Is linear In terms of the parameters, 

L 

6. 

2. N ? p + q.. 

3- -X Is of full rank, q. , ^ 

^ A. Each row vector, E^^j, In E, Is Independently 

sampled and distributed multlvatlate normally (MVN) with 

r 

expected value, 0' and covariance matrix Z 
The first assumplilon is rarely limiting in educational research and 
evaluation. Many models which appear non-linear on first sight 
are actually intrinaioally linear. In these situations, suitable 
linear m<^dels can be written following acceptable trans'- , _ 
formations of the original measures (see Draper and Smith, 1968; 
ch. 5, for a discussion of thesie types of models). Assumption 



nunfcer 2 Is required to ensure the availability of a sufficient 

nutiter of observations to estlinpta the pq elements In B. • , ^ 

The third asstimptl^ Is necessary to ensure a unique solution 
for 6. Since this solution requlresr the Inverse of X'J, It Is 



V 



necessary that X'X be nonslngular; hence X must be of full ra;ik- 

More general, non-unique solutions for g exist. These solutions, 
which use generalized Inverses of X'X, allow X to have more columns 
than Its rank (I.e., they allow J( to be less than full ran)c) . Since 
full-rank design matrices are usually quite easily constructed 
fH>t roost designs In educational research, the generalized Invers^ 
approach Is rarely, If ever, neede'tj. Arvyway, for any design matrix 
less than full rank, there always exists a transformation mat;flx, 

T 

J 

T, such that ' 
X* « XT 

l8 of full rank (see" Bock [1963], Grayblll [1961; pp. 235-239]', or 
Snl^th [1972], for details on computing T) . 



The fo.urth assumption provides the foundation for the theory 
f 

underlying tests of MGLHs. The assumption Implies th^at the 

The fourth assumption may be stated In an alternative form which 
will prove useful In "the development. Thus, If the columns In 1 
(and, correspondingly, the: columns In Y) are strung out to form the 
Np X 1 vector (and V^X theni^assumptlon number A states that 
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V^.~JilVN(0, 5 J) ' 

V ^ MVN(V , E X. I) [2.21 

where 0 Is an Np x 1 v^tor of zeros, Vj^ is an Np x 1 vector 

of expected values,^ V^, I Is an N x N Identity matrix, and the 
symbol, x. , denotes the Kronecker direct product (Cornish [1957], 
Searle [1966, pp. 215-220]; Vartak [1955]). Since the paxameters 
In B are linear functions of Y, the variance of B and, thus, testa 
of hypothesis Involving 6 follow more or. l^ss directly. 

Eatimation of B and Z 

Since the elements o^ B and I are nit genfea:all^;;>ttown In 
practice, they must be ^tlmated. Two proce^iirea^e available, 
viz. maximum likelihood (^^.) and Least Squares (LS). Since the 
former requires the assumption bf mqltlvarlate normality wherea.s 
the latter does not, and since the 'solutions for B» are the 
same in either case^ only the LS procedure will bo pursued. 

The Ls'^^ocedure calls for obtaining estimates of 0, B 
say, such that the error sums of squares and cro'ssproducto* 
(SSCP) are a minimum, , In matrix terms, estimates of B (viz, B) 
are obtained Ouch that the following equation is a minimum. 

E'E « CY-X6)'(Y-X6>. [2.3] 



LS procedure proceeds by differentiating the right-hand side of 
Equation 2.3 with respect to' elements \*ithin rows of g and setting 
the result equal to the null matrix (a comformable matrix of zeros) 

10 

f 
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This procedure yields a system of qp simultaneous equatloaii In B 
which are Jointly Bet equal to 0. Appropriate manipulation of / 



these equations leads to th^ system of Normal Equations, 

X-'XB ° X'Y . 12-^1 



which, assuming X'*X is nonsingular, is easily solved: 

. ^ B ^ (X'X)^-'-X7. * ' ' [2 -5] 

The estimates, B, have been shovm to be Unbiased and mln-imally 
dispersed by soveral authors (c. g. ,-^AndersoA [1958], Kulback 
[1968]/ Press 11972], Rao g^65]). / ' ^ ^ 

^ A sample estimate of. £, S say, is obtalncJd-^rm ,the error 
SSCP matrix. Thus, ■ ! 

E'E « Y'Y -'B'X'XB ^ * ^ [2. 6a) 

^ . -.X,- : . 

or, since B ^ (X'X)"''^X'Y, 

E'E ^ Y"Y - B'X'Y. ^ [2.6b] 

This quantity, which we will call SS^, is than uo^d as ah estimate/ 
of i*^.» - ^ 

S - est(E) - (N-p)'"'''SSj,, ^ 12-7] 

Expeatatiffn oJvLVayianan of B ^ ' . 

From the 8olutlor> of the. normal equationr*, we obtained 

B ^ (X'X)"^X7. ' ' ' 12-5] 

VJe note that B roprofiontfi a st ralf;ht~ f orwal^J linear t ronJ^ formation 
of Y v/hich for the time being may be written , 

B ° L'Y, where 1/ - (X'X)" X'. , [2.8] 
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Taking the expect-atlon of B,. we* have 

* I * , J' 

? • * ■ 

$(B) = &(L'Y) « L^eCY). A 
•prom Equation 2.1, however, we have ^ * / 

Y = xe + E, " ' 

Under assumption the E^j in E are distributed ; 

multivariately about 01 Hertbe, •\ 

6(Y) = XB. 
Therefore, ' 

g(B) = L'XB 

= (X'X)"-'-X'XB J 

• = 6-. i 
which demonstrates the unbiasedness of B. • ^ 

To develop the covariance matrix o£ B we make use of 
operations on Kronecker products. 

From Equation [2.8] we h^ve 
. . B = L'Y. 

If the columns of Y are strung out to form the Np x 1 column 
vector V , as before, then this equation may be written as 

V, * (I X. L')V \ ' ^2.9] 

^ ^ -y 

where I is p x p acid L' is q x N. The direct product, I 
is PQ ^.-..EB^i^hus , the overall product in Equation 2.9 is pq x 1. 
From the rules pertaining to the variance of linear transformations 
of random variables, we have 
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Var [(I L')yy] 

- (I X. L')' (I X* I) (I i'^ 

= (I' X. L) (r X' I) (I ^' y ^ ' t . 

- i ^ = (rz X. LI) (I X- L') 

• = I'll X. LIL' 
= L X. (X'X)"-"-. .. 

» Thus, for the distribution- of B, we may write 

( . ■ ~ -1 . .... ■ 

B MVUCVq ;i: x. (XX)]. . 

Through similar log^ It can be shown that for linear 
toj. trane format Ions of B, - ^ ' , 

^\ ^ ». AB^MVK[(1 X. A)V ; I X- A(X'X) ^A'] > ^ . 

ABC'-MVN[(C' X. I X. A)Vg; C'lC x- A(X'X)" A']. 
These values artf useful in constructing interval estimates around 
hypothesized values of transformations of 

Sums of Squares Due to Regression 
From Equation 2.6 we have 

ss^ = Y'Y - B'X7 ^ 

. , - Y'Y - B'X'XB. 

The right-most terms in these expressions give the SSCP due to 
regression of Y on X. Typically, we let 

SS„ =■ B'X'XB s [2.10] 

I 

and call SS„ the matrix of SSCP explained by the hypothesis that 
the full model,- given in Equation [2-1] hold* for the data. SS^ 
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Is that component of the total SSCP, ^'Y, that is left unexplained 
by the model • . , 



Teets of Hypotheeie Concerning B 

Another way of def inifig SS^^ i^s to say that it is 'the difference 

in SS^ obtained by the model given in Equation 2.1 and the model 



(define^ under the null hypotljiesis, 



Usually, however, we are not part^-cularly interested in the hypothes 
that all elements tn 6 are equal to zero; instead our interest 
is usually focused upon various subhypbtjieses involving linear 
J J (transformations of the B. For instance, let the q x q matrix, A, be 

■ ' ' " 

^Kr^ part J 



partitioned as 



\ 



where A- is (r x q) and A is [(q - r) x q]. Then t^ie model given 

■ ■ ' r 

in Equation 2.1 may be writter as 
Y =^ XAB + E . 



= X^A^B +X2A2B + E 



[2.11] 



where the partitioning of X is in conformance with the partitioning 
of A. We now let 

so that If we wished, we could -write the model |,s 



f?l 



Y - 



+ E, 



14 

15 



where we ngte that and not necessarily represent direct partitions 



of the original B. 


If we let 




X'X - 


X ' 

~2 




^21 -22 









then LS estimates of and are glvfen by 

>^ -1 ~2 

?i= ?irVi(!-w- 

\ Our Interest lies In testing the subhypathesis that 

) * 

^1 ^1^ ^* say. More formally, then, the hypothesis is 



'?1 




0 


J2 




.-I 



^ where Is allowed to take on some (unrestrl^ed) value under 
the( hypothesis* Under H^, the ^odel Is^ 



12.12] 



[2.13] 



* 

The LS estimate or Is gi^en by 

?2 " ?22 ?2l : 
By steps similar to those given earlier we compute 

where we let the last term on the right be denoted by SS^» 

— ~n 

tjie SSCP exp]^lned by the restricted model under Hp (viz, the 

> 



[^.14] 



[2.15] 



[2.16J 



-Ay 
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model in 'Equatlop 2»14)» ^ 
" • Our interest is in the difference^* 



[2.17] 



where SS„ was given In fequation 2.10. This diffeorence %b given 
by ' •* 



[2.18] 



[(B^ - 0)' (!2 !2^'^ 



?11 


?12" 




'?i -9" 


u21 


?22 




!2 - ?! 




2.12 


and 


2.15 



- B*) - X, 



-1„-, 



-1, 



22 hll - 111 l2l?l - ?22"'??~ 



" - ?22'^?21?r 



Following through with the matrix operations in Equation 2.18 

^ 

leads to the result 



5\ - !l^ll!l - 5l2?22"'?21?l^ 

-1, 



-H<?ll-M22 ?21^V 
Note, however, that from matrix theory we have the important result 
tji^t the inverse of the term in the parentheses above is equivalent 



;^|X"X)"^A^i That is to say 



(?11 - ?12?22''?21>'' = ^l(?l>:'^r 
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Therefore, we have 

SSu', aa defined here, is the SSCP explained by the hypothesis 

Hq: A^e - 0. [2.20] 

In many cases, we are Interested In tests of hypotheses- 
involving transformations on the columns of ^S. In other words, 
we are interested in hypotheses of the form 

H : BC - 0. [2.21] 

Under^this hypothesis, the model in Equation [2.1] becomes 

JC - XeC + EC 
and the LS estimate of BC is g^Hren by 

\BC - (X'X)""'-X'YC 
from which we can obtain 

' SSp^ = C'Y'YC - C"B"X'XBC 

- C'SS^C - C'SS^. [2,22] 
wh^re SS„ was defined In Equation 2.10 

Similarly, It could be shown that for 'the MGLH, 



we have 



SSgp = C'SS^C - C'B'A^'[A^(X'X)"^A^ 

- C'SS^C - C'SS„ C 

T~ Hq~ ^ 

= C'[SS^ - SSj^ ]C. [2.23] 



0 
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We may sunparlze the results^obtalned thus far In this ^ 

section by writing H for C'SS„ C, E for -C^SS^^C, and A for A,, and 

- -V ^ ^ ^^^^ ^ ^ 

noting that the most general statement af an MGLH^ay'be written 
as . • • • V 

ABC = 0 • ; 

where A Is a g x r matrix of rank g whose elements, a^'^ (1^1, 

2f**i g; j - 1» 2,.**, r). are used .4o select particular 
comblhatlons from the rows of B; c Is a p x u matrix of rank a whose 
elements, c^. (1^1, 2, p; j ° 1. 2, u) , are used 

to select linear combinations among the columns of B. 



y 



Teat Criteria 

Multlvajrlate test criteria are usually a function of the 

.1 ^ 

characteristic roots of HE (or, erfulvalently, of the deter- 
mlnantal equation |h- XE| - 0). Three popular test criteria 
are given In the next three paragraphs. ^ 

Trace criterion. The trace criterion Is the trace of 
HE"*^ which Is equivalent to the sum of the roots of the 
det^rmlnantal equation, 

|H - X^E I - 0; 1-1, 2,..., u 

According to Anderson (1958; p. 224) the asymptotic distribution 
of N X (trace HE""^) Is the chl square distribution with gu degrees 
of freedom (where g " rank of A, and u = rank of C). 
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Greatest aharaatezn-Btia voofaritevion. Bils test statistic 



uses the largest characteristic root, A- , of HE • convenience 



\^ may be converted to 



1 + X, 



9 - 



tor which table!' percentage points have been given (Heck [1960]; 
have also been reproduced by Morrison [1967],}. Parameters 

X 1 

for entering -the Heck tables are, 

s ■ min (g, u) • ' 

m.- (Ig - u| - 1)I2^\- 

n - (N - r " u - l)/2, where r - rank of X, 
Wilke maximum likelihood criterion. This criterion makes 
use of the statistic 



n (1 + X.) 
1-1 ^ 



-1 



where X (1 1, 2, . . . , u) ar^ again, the characteristic roots 
of HE ^. An equivalent form of the above expression is 

|H + E |g 



With large N, a ch4. square test due to Bartlett (1951) Is available 

k 

Thus, ' ' X 

o ^[N - r .5(u - g +*1)] In A 
is distributed as chi square with gu degrees of freedom. A better 
approximation (Rao [1965]) is given by ^ 
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F - 



1 - A^/^ Sl(N-r) + g-u-11 - .5(gu-2) 

u,g,N-r . ^ 



where 



'S 



2 2 , . 
g u - 4 

2.2 T 
g + u - 5 



Under the hypothesis, F is approximately distributed as an F 
statistic^ith gu and {S [(N-r) + g-u-1] -.5(gu-2)} degrees of freedom. 

For certain* values of g and u, Anderson (1958, §8r.5.1) gives 
the appropriate exact F statistic. For example when g " 1, 

1 - A , „ N-r+l-u 
F . Uxlilizr • - ^u,N7r+l-u., 

u,l,N-r 

In the special case where the rank of A (or the rank of C) is 
equal to one, the product HE""'", has only one*non-zero character- 
istic root. In this case, the largest root, the sum of the roots, 
and the product of the (non-zero) roots are all the same, thus, 
maklnftvthe three criteria equivalent. 




^ 3. Typical Appliaatione 
Having described th^ basic equations of the MGLH, we now turn 
to «in algebraic exposition of some of the more typical models found 
in educational research. Examples of typical applications using 
real data can be found in many of the references cited earlier as 
well as in Olson (1971). In the discussion which follows, we begin 
with the model for the analyel^ of variance, move to a brief discussion 
of the analysis of covariance, and finally pVesent a discussion on the 
ana|.y8is of repeated measures designs. 
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Multivariate' Aftalysis of Vaviccnoe 

Let the configuration of Figure 1 represent . the general design 
of a 2 X 3 factorial experiment. There are nj^ experimental units 
In each cell, and for purposes of exposition It will ba assumed 



that n 



r 



11 



'12 



'23 



n, though we realize this I9 not a 



necessary restriction on the general linear model. Measurements 

on each of p dependent measures (Y ° 1, 2, . . . , p) have been 

collected on each of the 6 x n experimental units. The notation, 

(£) 

^kij * denotes, the measurement of the V t\\ dependent variable for 
the k'th subject in the ij'th treatfnent comblrtatlon. It is assumed 



that each of the N 6 x n vectors, Y, 



- 1, 2, . . . , p) , have 



been independently ffampled and follow the multlnOrmal law with 
expected valu^, )^(Yj^) , and common varlance-covariance matrix Z. 
It is possible to write the N x p observations, given in figure 1 



as the N X p supermatrix of observations, 

(2) ^ 



v(l) 



v(l) 
nil 2 



v(l) 
nil3 



yd) 

ni21 



v(l) 
ni22 



yd) 

n-t23 



n!ll 



v(2) 
n:i2 



v(2) 
n~13 



v(2) 
n~21 



v(2) 



v(2) 
ni23 



y(p) 
fill 



v(P) 
n~12 



v(P) 
nil3 



y(p) 
ni21 



y(p) 
ni22 



y(p) 
n:23 
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yd) 
111 


y(2) 

111 


yfp) 

ill 


yd) 

J 1 ^ 


.y(2) 
J. 4. ^ 


y(p) 
• • • 112 


y<l) 
1<L3 


y(2) 

^113 


y(p) 
• • *^113 


•yd) 

^211 


y(2) 

^211 • 


y(p) 

^211 


yd) 

^212 


y(2) 

^212 


... ^2\2 


yd) 

^213 


y(2) 

^213 • 


•• ^213 


• • • 

^Jill 


• • • * 

12) 
^nll • 


t • • • • 

^nll 


• • • 

(1) 

^nl2 


• • • 

w (2) 


• • • • • • 


• • • 

v-d) 
^nl3 


• • • • 

* 

y(2) 

^nl3 • 


• • • • • 

y(p) 

nl3 


^121 


^121 ; 


(d) 

.. 1^21 




(2) 
U22 


y(p) 


yd) 


v(2) 
^123 • 


y(P) 
•• ^123 


yd) 

^221 


y(2) 

^221 ' 


y(P) 

•• ^221 


* 222 


y(2) 

^222 


y(p) 


yd) 

^223 


y(2) 

^223 " 


y(P) 

•• ^223 

r 


• • • 

. yd) 

^n21 


• • • • 

y(f) 


* 

• « • • • 

y(p) 




y(2) 
^n22 


• « 1 • • • 

y(P) 

••• ^n22 


• • 


• • • 

y(2) 

^n23 


1 • • • • • 

y(p) 

^n23 

' 1 



Figure Diagram or a General 2x3 Factorial Experiment. 
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( . 

where n denotes the number of rows In each of the n x 1 submatrlces. 
An appropriate design matrix, describing the experimental d*eslgn,. 





would be the N x 


6 Helmert-r 


t vDe 


mat rlx» 


of 


rank 6 % ' 
























n^ 


. n- 


1 
n- 


1 

n- 


1 ^ 
n- 


1 

n- ^ 


/ 






n- 


n~ 


-1 
n- 


1 
n- 


-1 
n- 


1 
n~ 








n- 


n~ 


0 
n- 


-2 
n- 


0 
n-v 


-2 

n- . 


9 




X - 


n- 


n- 


1 
n- 


1 
n- 


-1 
n- 


-1 
n- 








n- 


n- 


-1 
n- 


1 
n- 


1 
n- 


-1 
n- 








n- 


n- 


0 
n- 


-2 
n- 


' 0 
n- 


2 
n- 






It can easily be 


shown 


that 


the 


solution. of 


theVormal equations 



(Equation 2.4) yields the matrix of estimates, B, shown In Figure 2. 
It Is obvlobs that the first row In B estimates the grand means jal 
the dependent varlates. The second row estimates the row effect. 
The third and fourth rows (considered jointly) estimate the column 
effect, and the final two rows, the row ^ column Interaction^. 

In multivariate analysis of variance, hypotheses of Interest 
commonly Involve only effects due to Independent variables (I.e. , 

r 

ways of classification), Hypothes^es ^concerning contrasts among 
dependent variables ate considered under profile analysis. When 
all dependent variables are to be Included, the matrix C, 
Equation 2.2A Is set equal^^ I, the Identity matrix. To test 
hypotheses of no treatment or Interaction effects, the following 
A-matrlces are constructed: 



ERIC 



The reader should be aware that when cell frequencies nre 
dlsproportlonal (I.e., the design Is unbalanced) estimates of 
row, column, and Interaction effects will be confound^^d. See 
Overall & Splegl ( 1969) or Searle (1971; pp. 138-139) tor 
procedures* on treating unbalanced designs. 
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^ CO 
CM 



rsj 
I 

St * 

+ 



CU Cl^ (N 



I I 



CO ^ ro 

^ rsj ^ rsj 
PM 'H cu rsj 

^fH^H 




rsj • 



rsj c* 
rsj f- 



rsj 



^rsj 
rsj • 

I 

rsj • 



^ CO 

rsj • 

rsj 
I 

^ OJ 
rsj • 



rsj ^ 



^ rH 

rsj • 



^ rsj 
rsj rsj 

+ 

^OJ --^fH 
oj fH rsj rsj 

I I 

-^fH 
rg fH 



V-x m ^ ro 
H rsj rsj 

rsj rsj 

I + 

^ rsj ^ rsj 

rsj fH rsj oi 



I 



rsj fH rsj rsj 



rsj 



PQ ) 
U 

tl 



CO 

c 
o 

U 

:3 

V 
0) 



I rsj 



I 



rsj 



^rsj 
fH • 

I 

^fH 
fH • 



rsj 



fH • 

iH • 

+ 
fH 
fH • 



VO 



^ rsj 
fH rsj 

+ 

iH fH iH rsj 

fH 
rH fH 



^ ro ^ n 
fH fH f^ rsj 

I><" 

CNj rsj 
I + 
rsj rsj 
iH fH iH rsj 

\>*' 

+ I 
^ fH ^ H 
fH fH fH OJ 



§ 

<u 



d 
o 



o 

CO 



fl 



0) 



2A 



^(R) 



~(C) 




■fO D^l 
[O^ 0 



0 0 0] 

0 0 OJ 

0 0 0 

10 0 



■;4 



[3-1] 



-(RC)- [0 0 0 0 1 0 
10 0 0 0 0 ll. 



By substituting these matrlc-es Into the general linear hypothesis 
framework and letting C « I, the following, hypothesesi^are tben 
tested*: 



"0,M = 




° 0 


"0,R = 


*(R) ? ? 


a 0 


"o.c' 


^(C) 8 c 




"o.AC" 


■ 6(RC) ? 


C - 0 



Vy - 



13. '21 



where 



0,R 



Is the null hypothesis that the overall grar]^ 
means equal zero; 

Is the 'null hypothesis of no row effect;. 



C null hypothesis of no column -s^f feet ; and 



H Is the null hypothesis of no Interaction. ' ^ 

At alternative way to have conlJtructed the design matrix is 
1 • 
1 

- 1 9 



X - 



[3.3] 



1 
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where again n denotes the number of rows in each of the n x 1 
submatrlces. Using this design matrix, the solution of the normal 
equations would yield ^ 



B = 



yd) 
11 



yd) 

12 



y(1) 

13 



yd) 

21 



yd) 

22 



yd) 

23 



y(2) 

''ll 



y(2) 

12 



y(2) 
13 



y(2) 
21 



y(2) 
22 



y(2) 
23 



^(P) 
11 



y(p) 
12 ' 



y(p) 

\ 

y(p) 

"21 



y(p) 

.•22 



y(p) 
23 



[3.4] 



the group means on each dependent variable. Using B as defined 

f 

above the A matrices, 



~(M) " 


n 


1 


, 1 


1 


1 


1] 


A = 

~(R) 


[1 


1 


1 


-1 


-1 


-1] 


~(C) " 


[i 


^1 
1 


0 
-2 


1 
1 


-1 
1 




'A 

~(RC) = 


[1 


-1 
1 


0 
-2 


-1 
-1 


1 
-1 


5] 



[3.5] 



may then' be used to test the hypotheses given in Equation 3.2. 
Significance tests are computed by first finding H and E and then 
applying one of the test criteria given earlier. 
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Analysis of Covarianoe 

Suppose that, prior to* experimental treatment, measurements 
on the covarlates, Z^^^ and Z^^\ were collected for each experimental 
unit in Figure 1. An appropriate matrix of Independent variables 
would then ^be constructed of both design variables and the predldtor 
variables, Z^^^ , Z^^\. Thus, corresponding to [3.3], 



1 
n~ 



1 
n~ 



n- 



1 



(1) 
'11 

(1) 



5l2 
-13 

-21 

■ JJ-) 
-22 

z(^> 

~23 



z(2) 

-11 

z(2) 

-12 

z(2) 
-13 

z(2) 

-21 
-22 

z(2) 

~23 



[3.6] 



(IL) 

where the notation, , denotes the 1 vector of measurements,^ 



within the ij^th level of classification, on the V th covariate. 
The solution of the normal equations yields the 8x p matrix of 

9 

adjusted means and regression coefficients 



B" = 



''ll . ' 11 



^*(1) Y*(2) 
12 12 



r.*(l) r=*(2) 
"23 23 



^1 1 



2 2 



Y*(P) 
11 



.Y*(P) 
12 



^23 



w 



w 



(P) 
1 * 

(P) 



[3.7] 
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where w^^\ w^^\..., ^j^^ denote the wlthln-class regression 
coefficients of Y^^^^, Y^^\..., Y^^^ on Z^^\ By constructing the 



A-matrlces, 



) 



~(M) 



~(R) 



~(C) 



[1-1 1 ,1 1 1 ; 0 0] 

[1 1 1-1 -1,-1 i 0 — 0] 

1-1 0 1-1 0 I 0 0 

1-1-2 1 1-2 ; 0 0 



~^^h 11-2-1-1 2 



; 0 0] 

i 0 o]. 



and letting C " I, the hypotheses (Equation 3.2) can then be tested, 
this time for effects adjusted for regression on Z^^^ and Z^^\ 



If the subgroups on the covarlates are computed beforehand, 

then tests of unadjusted effects can be obtained easily. We note 

(H) 

that any element In B*, B*^ say. Is the coefficient for the regression 



of Y on adjusted for regression on the Z 



For instance, the 



cell mean for group ^,2 on the first dependent variable is given by 
^12''^^ = Y^2^ - Wj^Z^-""^ - It follows that the unadjusted cell 

mean for this group is ^ 



Therefore the A-matrlbx, 




A = [0 1 0 0 0 0 I Z^-"-^ Z^^^] 
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would yield 

= (for t = 1, 2. .... p), 

the unadjusted cell means. By including the overall covari^te means 
in the last two colutnris of each of the A-matrices given earlie'r, 
unadjusted treatment effects can be easily computed. 

Profile Analysis (Repeated Measures) ^ 

Profile analysis has been aptly discussed by Marks (1968; see 
also Morrison [1967; pp. 168-197]). Essentially the problem is one 
of determining whether the shape of a mean vector is equal to that 
of another mean vector. In the two-way classification, for example, 
the problem is to determine whether mean vectors are equal for the^^^^ 
different ways of classification. This is equivalent to asking 
whether contrasts among selected dependent measures for one group 
are equal to the same contrasts for another group. Observant 
readers will notice that t'his is precisely the problem in univariate 
repeated measures analysis of variance where the measures on the 
multiple dependent variables are taken over time. In the example 
situation described in Figure 1, the researcher may wish to deter- 
mine whether differences among adjacent dependent variables can be 
considered equal for all levels of the column factor. That is, he 
may wish to simultaneously test, 4 

29 
30 



and 



V i - V 1 



,(2) 



• • X 

.(3) 



; - V 1 

• • X • • X 



• • X • • X 



rf 



,(1) .,(2)' 



.2 



- y 



,(2) .,(3) 



- y 



.2 



• ••X va*^ aaX ••<> 



where the \x ^'s represent population values. It can^ be easily shown 
• -J , 

that with either of the design matrices given earlier, Its 
corresponding solution to the normal equations, the appropriate set 
of A-matrlcea, and the p x (p-1) matrix, 



(!)• (2) 

y o - y 4 



(2) (3) 

y , ^ y- T 



2 X 



„(P-i) . „<P) 



1 
-1 



0 
1 



0 -1 



0 0 
0 0 



0 0 
0 0 
0 0 



-1 1 

0 -1 



then the MGLH 



A 6C «= 0 



will yield the appropriate hypotheses for tests on profiles or 
repeated measures* 
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Repeated Meaaurea with Covariatea 

Two d:^stlnct types of repeated measures design involving covariates 
can ^e identified (cf., Winer [1962, 1971]). In the first, measures 
on the covariates are collected prior to the onset of any treatment. 
In the second, covariate measures are collected concomitantly, in time, 
with measures on the. dependent variable. Of course, a mixture of the 
two types of designs is possible. We shall. consider each type of 
design separately. 

m 

Covariates prior to the onset of treatment. The first type of 
covariate repeated measures design might be diagrammed as follows, 



-Time 



AD „(2) 
~1 ~1 



7<h) 

ri 



,(h) 



yd) 

~1 



,(1) 



v(2) 



,(2) 



y(p) 



,(P) 



where the vertical line indicates the beginning of treatment; the 
Zj . covariate observatioits for group J; and the , measures on 

the dependent variables for group J. All vectors in the diagram are^^^ 



"j ^ ^' ^^"^^^ number of subjects in group J. A good design 
matrix for this design would be the (n^ + n2) x (h + 2) matrix 



0 „.l 

The (n^^ + n^) ^ p matrix of dependent measures, Y, would be laid 
out in the usual way. The solution to the yormal equations would 
yield the (h + ^) x p matrix of estimates, 
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\ 



^1 h 



Y*(l) 
2 2 



1 



Y*(P) 
2 



W. 



(1) 



w 



(2) 



W. 



(P) 



(1) 



w, 



(2) 



w, 



(p) 



where Y*^^^ la the j ' th group mean on variable i adjusted for all 
Independent variables. This design presents no problem, and MGLHs 
can be constructed and tested In the usual manner. 

Repeated meaeures with oovariatea measured aoncomitantly , Tlie 
second type of repeated measures designs Involving covarlates Is 
somewhat- more difficult to handle. The design, for a two-group 
classification, can be diagrammed as follows. 



Time 




AD yd) 

r2 l2 



where the Z^^^ are the j'th-group observatlotls on the covarlate at 



Time 2 
til zi 



A2) A2) 
Z2 l2 



Time p 




A?) 
^1 



^(P) Y^P^ 

r2 I2 • 



-J 



time p: the Y^^^ are the J'th-group measutes on the dependent 
J 

variable at time p. In th|s situation, the design matrix, as 
constructed In the prevlousVsltuatlon, would be Inappropriate 
since It would lead to estimates of group means on each dependent 
variable adjusted for all Independent variables including those 
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covariate measures which follow the dependent measures In time. 
' What is desired, however, are estimates of group means, on each 
dependent variable, which are adjusted only for those covariate 
measures which werjp collected at the same point in time or earlier. 
In other words, the parameter matrix of interest is 



r 



,(1) ,,(2) 
1-12 



'1-1 



1-123 



(3) 

^2-1 ^2-12 ^2-123 



,(1) „(2) 



0) 



(1) 



(JL) 



(JL) 



(2) 

1- 2 

(2) 

2- 1 



(JL) 



(JL) 



(JL) 



(3) 
'1-23 

(3) 

2- 13 

(3) 

3- 12 



(JL) 



(JL) 



(JL) 



/P) 

1- 12. 

2- 22. 

(P) 

1- 23. 

(P) 

2- 13. 

(P) 

3- U. 



(JL) 



(P) 

p.l2. . .p 1 



where 



1 • jkl. 



= population group mean on dependent variable 2. adjusted 
for Z<^>. Z^'^^. Z^^\ .... 



(JL) 



ijk. 



within-class population regression 
;oef f icien 



coefficient of Y^^^ on Z^^^ adjusted for 



To obtain estimates of this matrix a gp.nevalized (see Press 

(1972; pp. 217-^27]) multivariate linear model is used. This model 

has the form given in Figure 3. * 
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v(2) 



0 
0 



y(p) 
~1 



v(l) 
^2 



0 \ 



t • • • « • 



,(p) 



1 0 z 



(1) ,(2) 



1 o'z{^) 



7(P) 
~1 



0 1 z 



0 1 



« « • « « • 



(1) 

2 

(1) 



^2 



« • « « • 



^(2) 
^2 



f2 0 



6 + 



Figure 3. Generalized Multivariate Linear Model , 

where the » » "j ^ ^ vectors .defined in the diagram for 

the design, and the, symbols, 1 and 0, denote conformable column vectors 
of I'V and O's respectively. In the model, the matrix, 3, Is the 
desired matrix of parameters, / 

The least squares solution to the model In Figure 3 provides the 
appropriate matrix of estimates. Tests of the repeated measures 
effects are then made by formtjng MGLHs Involving the first two rows 
of B, For Instance, with 

A [1 - 1 0 0 ... 0] (where A is 1 x (p + 2)) 
1 -1 0 0 01 



0 1 -1 . . . 0 0 



0 0 0 



1 -1 



(where C is (p - 1) x p) 



the MGLH, ABC « 0, would provide an appropriate test of the null 

hypothesis of equal profiles for the two groups. 
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